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Abstract: We give the detailed derivation of the fully covariant form of the quadratic 
action and the derived linear equations of motion for a massive graviton in an arbitrary 
background metric (which were presented in arXiv: 1410.8302 [hep-th]). Our starting point 
is the de Rham-Gabadadze-Tolley (dRGT) family of ghost free massive gravities and using a 
simple model of this family, we are able to express this action and these equations of motion 
in terms of a single metric in which the graviton propagates, hence removing in particular 
the need for a ’’reference metric” which is present in the non perturbative formulation. We 
show further how 5 covariant constraints can be obtained including one which leads to the 
tracelessness of the graviton on flat space-time and removes the Boulware-Deser ghost. This 
last constraint involves powers and combinations of the curvature of the background metric. 
The 5 constraints are obtained for a background metric which is unconstrained, i.e. which 
does not have to obey the background field equations. We then apply these results to the case 
of Einstein space-times, where we show that the 5 constraints become trivial, and Friedmann- 
Lemaitre-Robertson-Walker space-times, for which we correct in particular some results that 
appeared elsewhere. To reach our results, we derive several non trivial identities, syzygies, 
involving the graviton fields, its derivatives and the background metric curvature. These 
identities have their own interest. We also discover that there exist backgrounds for which 
the dRGT equations cannot be unambiguously linearized. 
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1 Introduction 


The last 15 years have seen several developments about massive gravity originating in the 
DGP model [1] and its ability to produce a late time acceleration of the Universe via a large 
distance modification of gravity [2, 3]. One of the main motivations for massive gravity 
(see [4-6] for recent reviews) is indeed the wish to replace dark energy by a non vanishing 
graviton mass, and to give a mass to the graviton is one of the simplest (conceptually speaking) 
ways to modify gravity in the Infra-Red. In fact, there has been many attempts to write a 
consistent theory for a massive graviton since the invention of general relativity and various 
problems have been encountered on this road. The unique consistent theory for a free (non 
self interacting) massive spin-2 field on a Minkowski space-time has been known for a long 
time as the Fierz-Pauli theory [7]. It propagates 5 degrees of freedom of positive energy, those 
of a transverse, traceless, symmetric, two times covariant tensor. However, one of these d.o.f., 
a scalar mode, leads to the well known van-Dam Veltman Zakharov discontinuity: namely 
the fact that, however small the graviton mass, Fierz-Pauli theory leads to different physical 
predictions (such as light bending) from those of linearized General Relativity [8]. This is 
enough to rule out Fierz-Pauli theory stricto sensu from solar system experiment. However, a 
way out was suggested by Vainshtein [9] considering self-interactions of the massive graviton. 
This so-called ’’Vainshtein mechanism” was criticized soon after it was proposed by Boulware 
and Deser in the seminal paper [10] and it was only recently that it was actually proven to 
operate as predicted [11-14] (see also the review [15]). Boulware and Deser also discovered 
a new pathology of generic massive gravities: the propagation of a ghost-like 6th degree of 
freedom at the nonlinear level. It was long thought impossible to obtain a massive gravity 
theory devoid of this ghost (see e.g. [10, 16]). However, a family of massive gravity theories 
was recently proposed by de Rham, Gabadadze, and Tolley (dRGT in the following) [17-19] 
in which the absence of ghost was first addressed in the so-called decoupling limit [18] (using, 
in particular, the approach of Refs. [16, 20, 21]) and then fully shown at the nonlinear level 1 
by a Hamiltonian analysis, later extended to bimetric theories [24-27] (see also [28, 29]). 
The Hamiltonian analysis of these models remains however involved and does not clarify the 
reasons behind their soundness. The original discussions of the dRGT models have been 
using a metric formulation, however it has been pointed out that a vierbein formulation (first 
obtained in [30] building in particular on the older work of ref. [31]) makes easier the obtention 
of the constraint necessary to remove the Boulware Deser ghost [30, 32] 2 while it also makes 
the theory more elegant by the removal of an unpleasant matrix square root. Note however 
that the two formulations are not totally equivalent [38, 39] and we will consider here only 
the metric formulation. 

The aim of this paper is twofold. First, starting from the dRGT models, we provide the 
detailed derivation of the action and derived field equations of a massive graviton with (at 
most) 5 polarizations on an arbitrary background. The obtained theory is written in a fully 

1 Note, however, that these results were initially debated [22, 23]. 

2 See also [33-37], 
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covariant way and has been presented in Ref. [40]. In particular, we will, for a subset of dRGT 
models, show how the extraneous reference metric (usually taken to be Minkowski) can be 
fully integrated out, leaving only one metric which serves as a background for the graviton 
propagation. The obtained equations contain curvature tensors of the background metric 
entering into the mass term in a non trivial way. Secondly, we provide a full Lagrangian 
covariant proof that the graviton we consider has always , i.e. on arbitrary backgrounds, at 
most five polarizations 3 4 and not 6 as could be expected from the Boulware-Deser argument. 
To do so, we show how the field equations lead to 5 covariant constraints like in the long known 
case of Fierz-Pauli theory over flat space-time or Einstein space-times [20, 43-47]. Out of these 
constraints, four are easy to find, just coming from the Bianchi identities associated with the 
diffeomorphism invariance of the kinetic terms. The fifth constraint is a scalar constraint 
necessary to eliminate the Boulware-Deser ghost and, in the Fierz-Pauli case, leads to the 
tracelessness of the graviton field. We will show that such a constraint can always be obtained 
irrespectively of the background metric, i.e. even when this background metric does not solve 
the background field equations. Such a constraint has been obtained in the generic case in the 
vierbein formulation of dRGT (with a Minkowski reference metric though) where it appears 
surprisingly simple, at least in the simplest models of the dRGT family. As we will see, the 
derivation of the similar constraint in the metric formalism and from linearized held equations 
is much more involved and requires several non trivial technical tricks. In particular, to reach 
our results, we derive several non trivial identities - syzygies - involving the graviton held, its 
derivatives and the background metric curvature. These identities have their own interest. We 
also discover that there exist backgrounds for which the equations cannot be unambiguously 
linearized. 

This paper is organized as follows. In the following section we introduce massive gravity 
and covariant d.o.f. counting. Section 3 summarizes the main results of the paper as well 
outlines, without giving the technical details, the general reasoning and technical steps. The 
detailed proof is given in the following two sections, while some long formulae and technical 
steps are differed to appendices. Section 6 discusses some simple applications of the obtained 
results: hat and Einstein space-times as well as FLRW space-times. A last section contains 
conclusions. 

2 Introduction to massive gravity and dRGT theories 
2.1 Fierz-Pauli theory 

Let us hrst introduce the (quadratic) Fierz-Pauli theory [7] on a hat background. This theory 
can be defined 1 by the following action (in the absence of matter source) for a rank-2 covariant 

3 It is known that some polarizations can be prevented to propagate depending on the background [41, 42] 

4 For simplicity, we only discuss in this section the case with d = 4 dimensions, but such a theory can 
be introduced in a similar way for arbitrary d, where it is found that a massive graviton in d dimensions, as 
described by the quadratic Fierz-Pauli theory, has (d — 2)(d+l)/2 physical polarizations, i.e. the same number 
of polarizations as a massless graviton in d + 1 dimensions. 
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tensor h 


fLV 


s h ,m = -Ml J d A x [{d p h up ) 2 - (d p h) 2 + 2 {d p h) (d v K) - 2 (d p h up ) (d u h pp ) 

+fh 2 (h^h^ - h 2 )] . 


( 2 . 1 ) 


Here Mh is a mass parameter, all indices are moved up and down with a flat canonical 
Minkowski metric 77 ^, and h is defined by h = h^rj^. The terms with derivatives on the 
right hand side of Eq. (2.1) are obtained by expanding the Einstein-Hilbert Lagrangian 
density y/—gR(g ) at quadratic order around a flat metric, writing g pu = rj pu + h pv . The mass 
terms appear with a factor of fh 2 on the right hand side of Eq. (2.1), and this particular 
combination of h? and (h pv ) 2 is the only one able to give a mass to the graviton h fW in a 
consistent and ghost-free way. Note that this theory explicitly breaks general covariance and 
also that it uses two rank -2 covariant tensors, h pv as well as rj pu which serves as a background 
on which h pu propagates. Varying the above action (2.1) with respect to h pv one easily 
obtains the field equations in vacuum 


8E pv = 0 , (2.2) 

where 5E pu is defined by 
_ 9 

— Tfl 

^E pu = 5Q pv -| — (h pi , — h T/fiu) (2-3) 

L 777,2 

= ~ 2 (d/Ah + Uhn V - d p d p h p v - d p d v h p p + - Dh)) + — {h pu - h g pu ) . 

Here 8Q pi/ is just the linearized Einstein tensor and hence, as a consequence of Bianchi iden¬ 
tities, its divergence d p vanishes identically. Thus, acting with d p on Eq. (2.2), we get (for a 
non vanishing fh) 


d p h pi/ - d u h = 0 . (2.4) 

Taking another derivative of this equation yields 

d v d p h pv -Uh = 0, (2.5) 

where the combination appearing in the left hand side above happens to be just the lineariza¬ 
tion of the Ricci scalar. This combination hence also appears taking the trace of the tensor 
8Q pl , which appears in the field equations. This means that one has the following fundamental 
identity (where appropriate coefficients have been included) 

2 8 p d v 8E pv + m 2 rf v 8E pv ~ 0 , (2.6) 

where here and in the rest of this paper two expressions separated by the symbol are 
by definition equal off-shell (i.e. without using the field equation) up to (possibly vanishing) 
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terms containing undifferentiated or once differentiated h /w but no second or higher derivatives 
acting on h^ v . Explicitly one has off-shell 

2d^d u 5E^ + m 2 rf v 8E lu , = ~m 4 h . (2.7) 

Now the left hand side of Eq.(2.6) (or (2.7)) vanish on-shell which in turn yields a constraint 

on specifically that h vanishes in vacuum. Thus, using also (2.4), we conclude that 
is transverse and traceless in vacuum, i.e. that it obeys the two equations 

^ V = 0 , (2.8) 

h = 0. (2.9) 

These two equations give 5 Lagrangian constraints (being at most first order in derivatives) 
and this removes 5 of the a priori 10 dynamical degrees of freedom of h ILl/ . Hence, quadratic 
Fierz-Pauli theory propagates 5 polarizations. A similar conclusion can also be reached using 
a Hamiltonian counting (see e.g. [10]). Here, we stress that the crucial step to reach this 
conclusion is the identity (2.6) which would not hold for any other mass term than the one 
of Fierz-Pauli. On the other hand, the four constraints (2.4) are consequences of the Bianchi 
identities leading to 


fESE^ ~ 0 . ( 2 . 10 ) 

And hence, constraints analogous to (2.4) can always be found. Note further, that Fierz-Pauli 
theory and the above described way of counting its degrees of freedom can easily be extended 
to the case of a background metric describing a maximally symmetric space-time with a non 
vanishing curvature [46] or even a generic Einstein space-time [47]. 

2.2 Nonlinear massive gravity 

In order to go beyond Fierz-Pauli, this needs to be suitably nonlinearly extended. Such a 
nonlinear extension can generically be formulated with an action of the form 

S g , m = Mg J d A xy/\g\ [iR(g ) - 2 m 2 V (5)] , (2.11) 

where V is a suitably chosen scalar function of fow, fn and M g are mass parameters, 

R(g ) is the Ricci scalar constructed from the metric g^, and the theory contains, besides 
the dynamical metric g^, a non dynamical metric usually considered to be flat, and 
which appears in the action only through the tensor Note that since in the considered 
theory space-time is endowed with two metrics, there is some ambiguities on how to move 
indices. In the rest of this paper, all indices will always be moved up and down with the 
dynamical metric g gv . If one wants to consider a ” nonlinear massive gravity” stricto sensu 
(i.e. an nonlinear extension of Fierz-Pauli theory) the potential V should be chosen such 
that (i) g^ = rj^ is a solution of the field equations (irrespectively of the choice of f^), and 
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(ii) when expanded at quadratic order around this flat background, the action (2.11) has the 
Fierz-Pauli form (2.1) 5 . It is easy to figure out that there are infinitely many functions V 
that satisfy these requirements (see e.g. [48]). The dRGT theory has an action of the above 
form (2.11) depending (in 4 dimensions) on 4 reals parameters /3 n , a subset of which can be 
chosen in order to fulfill the conditions (i) and (ii) above. However, we stress that our results 
will be true whether or not this fulfilment is chosen, and hence do not only apply to nonlinear 
massive gravity stricto sensu as it is defined above, but to larger set of theories. 

As first noticed by Boulware and Deser [10], quadratic Fierz-Pauli theory (2.1) and its 
nonlinear version (2.11) differ in general dramatically as far as the number of propagating 
degrees of freedom is concerned. Indeed, a generic nonlinear massive gravity as defined in 
(2.11) propagates an additional sixth ghost-like polarization, usually called the Boulware- 
Deser ghost. Schematically, this comes from the fact that the analog of the constraint (2.9) is 
lost, while there are still 4 constraints similar to (2.8) (or (2.4)) (see e.g. [21]). Indeed, now 
vary action (2.11) with respect to to obtain the held equations, 

= Gf_ iu + m? V^ u = 0 , ( 2 . 12 ) 

where the Einstein tensor built from the metric g MJ ,, and the interaction contribution 
are both defined through 

C - P 1 p V ~ ~ 2 (n ^ 

y t w — I'liv ’ v > w — Sgi w ' (^■ i 'P 

V^ u is obtained from varying the term coupling the two metrics y/—gV($), and hence contains 
no derivatives. As a consequence of that and of the usual Bianchi identities we have that 

V^-O, (2.14) 

where is the covariant derivative built from the metric g^, u . The above equation, yields, 
taking into account the held equations, the four Lagrangian constraints (analogous to (2.8)) 

V> l E, lv = m 2 V'%„ = 0. (2.15) 

However, on tracing over Eq.(2.12) and using derivatives of Eq.(2.15), there is now (as opposed 
to linear Fierz-Pauli theory) no reason in general to get an extra constraint (cf. also [21]). 
This reflects the fact that it was thought impossible to construct a nonlinear Fierz-Pauli 
theory, with a suitable potential V, devoid of the Boulware Deser mode [10, 16] until the 
discovery of de Rham-Gabadadze-Tolley (henceforth dRGT) theories that we now introduce, 
introducing also some notations used in later sections. 

5 Note that when one has two metrics, one can write any non trivial non derivative invariant built from the 
metrics as a function of 5, and hence the only restriction (besides diffeomorphism invariance) comes here from 
requirements (i) and (ii) on V. 
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2.3 dRGT theories 

dRGT theories are nonlinear Fierz-Pauli theories (2.11) for which the function V takes a 
special form. Using the parametrization of dRGT theories proposed in Refs. [27, 49], the 
action is containing a square root S of the tensor $ defined as 6 

S» a S? v = <rfa V = W v , (2.16) 


and can be written, 

S g , m = Mg f d 4 xVW\ [R(g) - 2rn 2 V (S; f3 n )\ , (2.17) 


where the potential reads 


3 


V(S-/3 n ) = Y,Pne n (S), 

n=0 


(2.18) 


where the [3 n are 4 dimensionless parameters and e n (S) are n’th order elementary symmetric 
polynomials of the eigenvalues of their matrix argument. For a generic d x d matrix M, e n (M) 
is given by, 

e n (M) = ^r [ai ...M“" ttn] , (2.19) 

where the brackets [ ] over indices denote the sum over unnormalized antisymmetrized per¬ 
mutations. Note that in d dimensions, e n (S) = 0 for any n > d while ed(S ) = det(S). They 
can also be constructed iteratively starting from eg {S) = 1 and using the recursive definition, 


1 

e n (S) = — V(-l) k Tr[S k ]e n - k ($), n>l, (2.20) 

n z ' 

k=l 


with Tr[X] = X p p indicating a matrix trace operation, and S k is the k-th power of the tensor 
S (considered as a matrix), i.e. 



Qfl QPl Qpk — 1 

° pi ° p2 ” ' ° ^ 


( 2 . 21 ) 


We stress here that above and in the rest of the paper, we consider at various places matrix 
operations (such as the elevation to a power as above) as applied to a tensor such as S, 
which, when we do so, must be taken to have one contravariant space-time index (which then 
corresponds to the line index of the matrix) and one covariant index (corresponding to the 
column of the matrix). 

An alternative formulation to action (2.17) is to consider S as an independent field and 
to enforce the relation (2.16) by a Lagrange multiplier , adding to the Lagrangian a term 
of the form (in the spirit of e.g. Ref. [17]), 

Vi J\C^(S p a S\-g^f^) . (2.22) 

6 This is a matrix square root. In fact a tensor like 5 can have finitely many, infinitely many, or no square 
roots. See e.g. [38]. 
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This alternative does not feature the presence of the unpleasant square root in the action. 
Whichever way is chosen, the presence of the square root either directly in the action or 
via a Lagrange multiplier is a somewhat inelegant aspect of the theories considered which 
also introduces technical and conceptual difficulties. A vielbein formulation of these theories 
[30] (or at least of a subset of them) offers a nice alternative which does not suffer from the 
same lack of elegance (even though the two formulations do not lead to equivalent theories 
[38, 39]). For the purposes of this paper we will however work in the metric formulation, 
using the action (2.17) and assume the existence of a tensor S obeying (2.16). 

As a step to derive the field equations, but also for a later use, we can express the variation 
8e n (S) of the e n (S) in terms of that of S ' 2 = £ noted 5S 2 = 5$ 

1 n 

5e n (S) = -- ^(-l) fc Tr[5 fc - 2 ^ 2 ] e n _ k (S ), n > 1 , (2.23) 

Z k =1 

while obviously 5eo(S) = 0 (since eo (S) = 1). This expression is obtained varying the recursive 
relation (2.20) w.r.t. S and a detailed derivation of it is given in appendix A. It holds provided 
that S is invertible, which is always true whenever / is (g having obviously an inverse). The 
interest of this expression is that 5S 2 can easily be obtained in terms of the variation of the 
metric (or its inverse) - see the definition (2.16). Using (2.23), it is easy to work out the 
detailed form of the interaction contribution, 

3 n 

£(-l)” + V„[S"-*]VhS), (2-24) 

n= 0 k =0 

which enters into the field equations (2.12). In the original construction of [17-19] the ref¬ 
erence metric was taken to be a flat Minkowski metric, i.e. = T] tw . Out of the 4 j3 n 
parameters, one parameter was also fixed such that g lw = was a valid solution to the 

equations of motion and another was chosen such that m 2 corresponded to the Fierz-Pauli 
mass for the fluctuations h jw = g tw — g liv (condition (i) and (ii) given before). The choice 
of trading one j3 n parameter for the Fierz-Pauli mass can be made without loss of generality 
because it simply amounts to an overall scaling of the parameters, but the other fixing is more 
restrictive on the theory. However, it was shown in [26], using an ADM parametrization, that 
the choice of could be relaxed and that the theory is free of the Boulware-Deser 

mode for arbitrary f /w ." This allows to get flat space-time solutions for g^ v without having 
to fix any /3 n as can be seen by considering solutions of the form = c 2 g tw . In this case 
the interaction term (2.24) reduces to a cosmological constant, 

Vfiv = A g^ , A = /3 0 + 3c/3i + 3c 2 /?2 + c 3 /3 3 . (2.25) 

Hence, it is generically possible to set A = 0 and thus have flat g^ u solutions without fixing a 
parameter of the theory but instead allowing for c / 1 [59] . This leaves three free interaction 

7 The dRGT theories seem to suffer from other problems though, ranging from difficulties in agreeing with 
cosmological observations to generic instabilities, hidden strong coupling and causality issues [37, 41, 42, 50-58]. 



parameters as opposed to the two appearing in the original dRGT construction. Note however 
that, as can easily be seen from (2.18) or (2.24), fa simply parametrizes a bare cosmological 
constant, which by itself does not give any mass to the graviton. 


2.4 The "fa model” of dRGT family 

In the following, we will consider the simple case where fa and fa vanish but will keep 
otherwise fa and fa arbitrary as well as the metric fp V (which will in particular not assumed 
to be covering a Minkowski space-time). We will call the corresponding model, the "fa 
model”, since for it to be non trivial, fa needs to be non vanishing which we will also assume 
(but no further assumption will be made on fa), otherwise one is left with a mere cosmological 
constant. As we just discussed, a generic fa model will not necessarily fulfill conditions (i) and 
(ii) above (e.g. in the case where fa vanishes in addition to fa and fa, there is no solution of 
the field equations where gp V is flat irrespectively of the choice of fpv), our results will hence 
be valid for theories not in the category of ’’nonlinear massive gravity” stricto sensu. For the 
/ 3± model, the field equations are given by 

+ m 2 \fa + fa g^p (ei (S)S P - S p „)} = 0, (2.26) 

where we recall that e\ (S) is just the trace of S, i.e. e\ (S) = S p p . We observe that these 
equations are linear in S , so by tracing the equations we find that, 

— R + m 2 [4/3 0 + 3 fa ei(S)] = 0 , (2.27) 


which we may use in turn to get the following expression of S in terms of Ricci curvatures of 


9fiv- 


S p 


1 

fa m 2 



:S P R 



(2.28) 


Since S p a S a u = g pa fw, this can equivalently be seen (by squaring the above equation) as 
expressing fp U in terms of g and its Ricci curvatures (see eq. (4.5)). This remarkable 
feature means that later on, in our study of perturbation theory, we will be able to eliminate 
any and all occurences of the auxilliary metric f fJV in favor of g^ v and its curvature using the 
above equations. The resulting perturbative equations will then describe the dynamics of a 
spin-2 field propagating on a generic background gp U without any need for a second reference 
metric. This feature is unique to the fa model and only requires a non-vanishing fa. 


3 Outline of the general reasoning and results 

Before going through the technical details of what is a rather cumbersome analysis we will first 
sketch the general procedure and technical steps of our reasoning, stressing in particular where 
the technical difficulties arise. This will hopefully help in making the subsequent analysis more 
transparent and easier to follow, as well as allow the reader to have a first overview of our 
achievements without having to enter into the details. For the sake of simplicity, we discuss in 


9 






this section only the j3i model, however some of our results presented later are more general 
and apply to an arbitrary model in the dRGT family. 

We start from the field equations of the model (i.e. Eq. (2.12), with V /w given in (2.24) 
with @2 and vanishing), 

E pu = Gpu + m2 + m2 [A) 9nv + h 5mp (ei(S')<5£ ~ S P v)} = 0 • (3.1) 


Then, we linearize these equations around some solution for g pu , defining h pv as the hrst 
order perturbation of the metric g (the metric f iw being considered hxed, as usual). The 
obtained equations read (with obvious notations) 


bE pu = 5G pu + m 2 5V pi , 


V a + m' 


M pa 


hpa — 0 , 


(3.2) 


where £ pu p<7 h pa gives by dehnition the linearized Einstein tensor 5G P u and M. pu pcT h pa the 
linearized interaction term bV pv . Obviously, bQ pv is easy to obtain. The difficult step consists 
in getting the ’’mass term” A ip,v pa h pa and our hrst achievement is to provide here an explicit 
and covariant form for this term. 

In order to do so, a hrst necessary step consists in expressing the variation of S, 5S, in 
terms of h /w . since this term appears in the linearization of the held equation (3.1) above . 
The corresponding expression is given in Eq. (4.13). To do so, one can notice that 5S obeys 
the equation (see (2.16)) 8 


S p u ( 5S)\ + ( 5SY V S\ = 5 r a ■ (3.3) 

which can be considered as a matrix equation in the class of the so-called Sylvester equation 
(see appendix B), and where the right hand side is easy to get in terms of h pu . A known 
mathematical results is that this equation has a unique solution for 5S if and only if the 
spectrum of S p u and —S^u do no intersect (which is in general the case here). In this case, 
one can express the solution for 5S linearly in terms of 5$, see e.g. [61]. We will here 
rederive this result and in fact provide an alternative expression to the one we know from the 
mathematical literature [61]. 

Having obtained the linearized field equations (3.2), it is possible, as we saw in the 
previous section, to replace there any occurence of f pv (or equivalently any occurence of S, 
since in fact / only enter the field equation in the term M. p u pa h pa through S ) by an expression 
containing only g fw and its curvature. The possibility to do this replacement singles out the j3\ 
model as rather special and is, apart from the mere simplicity of the model, a reason for why 
we have focussed on this model in the present work. Doing so we obtain the field equations 
for a graviton h pv which propagates in a single metric g pu entering into its kinetic term but 
also in its mass term where the Ricci curvatures of g explicitly appear. These equations can 
be considered as the starting point of the second part of our work which consists in proving 

8 This equation has also been considered in Ref. [60] for the study of perturbations over the Schwarzschild-de 
Sitter black hole. 
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covariantly that these equations propagates only at most 5 polarizations of v irrespectively 
of the metric g. In particular, we stress that in this proof, summarized in the rest of this 
section, we will not use the fact that g is a solution of the background equations (3.1). 

As outlined in section 2 it is easy to find four vector constraints stemming from the fact 
that 'S7^5Q I1V ~ 0 as a consequence of the Bianchi identity (where V denotes the covariant 
derivative taken with respect to the background metric g^ w and the exact expression for 
LV is given in Eq. (5.3)). In analogy with the flat space case considered in section 2 we 
are interested in finding a fifth scalar constraint which generalizes the constraint h = 0 for 
the flat space case (c.f. Eq. (2.9)), as opposed to the vector constraints 

V' l 8E flu = 0, (3.4) 

which in turn generalize Eq. (2.8). Accordingly, we look for a linear combination of scalars 
made by tracing over the field equations (3.2) and their second derivatives which would not 
contain any derivatives of h^ w of order strictly higher than one. However, we have now at 
hand two (symmetric) tensors which can be use to take traces, namely the metric g liu and its 
Ricci curvature R fiu . Equivalenty we could also use the metric and the tensor S^, trading 
for via equation (2.28). We will in fact choose the second possibility which turns out 
to be more convenient technically speaking. We stress however that the two possibilities are 
strictly equivalent and do not impose any restriction on g, since (2.28) can just be considered 
as a definition of S lw in term of R, IW (as opposed to a background field equation). Hence we 
define the scalars •hj arising out of tracing the equations of motion 

<h 4 = [^r^, ( 3 . 5 ) 

together with the scalars T* arising out of tracing the derivative of the divergence of the 
equations of motion in various ways, 9 

% = ±[S i rV„V x 6E x ,. (3.6) 

where the tensor S l is defined as in Eq. (2.21). These scalars actually exhaust all possibilities. 
In fact this set of scalars can be further restricted by noticing that any power of S, S l , with 
i > 4 can be replaced by a linear combination of lower powers of S via the use of the Cayley- 
Hamilton theorem, as will be explained with more details below. To summarize we look for 
a specific linear combination, 

3 

(ui + Vi Tj) , (3.7) 

i= 0 

with a set of seven 10 scalars {ui,Vi} to be determined, such that 

3 

^2 {Ui$i + Vi^i) ~ 0, (3.8) 

i =0 

9 The i = 0 case includes the ordinary trace obtained by contracting the free indices together using g as 
a raising operator. 

10 There are eight scalars {ui,Vi}, but they need only to be determined up to an overall factor. 
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i.e. which contains off-shell no second (or higher) derivatives of h IJV . In the following, we will 
hence concentrate on the second derivatives of h^ u appearing in the scalars Tj and Tj and show 
that these are in the form of linear combinations (with S-dependent coefficients) of 26 different 
scalars N, made by contracting V nVuh pa with powers of S (including zeroth power which just 
gives the metric). Examples of such scalars are V pS7 a h pa and [S 13 ]^ [5 3 ] M ^ \7 p \7 a h /JV . and the 
exhaustive list 11 of them is given in Equations (5.5) to (5.14). Hence the linear combination 
(3.7) verifies 

3 26 

^ (m Tj + Vi Tj) ~ ^2 a^i , (3.9) 

i =0 i =0 

where the a* are scalars functions of Ui,Vi, S, JU and g pv that we will determine. Writing the 
equation (3.8) we get a priori 26 equations for the seven unknowns {t/j, Uj} by equating to zero 
each coefficient on of the Nj. We will however show that not all the scalars Nj are independent 
of each other, thanks to identities, syzygies, which can be obtained using again the Cayley 
Hamilton theorem applied to a suitable matrix, as we will explain (this is a consequence of the 
’’second fundamental theorem” of invariant theory [62-64]). These identities are gathered in 
appendix D and they are just enough to reduce to seven the number of equations to be solved 
to fulfill (3.8). This yields a unique solution for the coefficients {ui,Vi} which translates into 
the (we stress: off-shell) identity 


m 2 /?i e4 

4 


$0 ~ 


—e 3 Tq + e 2 T i 


ei T 2 + T 3 , 


(3.10) 


where here and in the rest of this paper, we do not write out the functional dependence of 
the e n when they depend only on S, i.e. it is to be understood that e n = e n (S) (otherwise we 
will make the functional dependence explicit). Hence, using the field equations, we get the 
scalar constraint 

m 2 /3i e 4 , 

- —- To — e 3 To + e 2 Ti — e\ T 2 + T 3 = 0 . (3.11) 

It can be seen that, when one sets g pv to be equal to the flat metric r/ pu , this constraint is 
just h = 0. This not only shows that one recovers the constraint (2.9), as one should, but 
also that the constraint (3.11) is independent of the vector constraints as it should. The flat 
space-time limit as well as the applications of these results to the case of Einstein and FLRW 
space-times are discussed in details in section 6. 


4 Linearized field equations for generic dRGT models 

In this section we obtain the linearized field equations for a generic model in the dRGT class 
(2.17) or equivalently the second variation of the action (in particular, we will not assume here 

11 All scalars made by contracting V^V^/ipo- with powers of S lower or equal to 3 either appear explicitly in 
the list or are ~ to scalar of this list as can be seen by commuting covariant derivatives acting on - which 
only creates terms ~ 0 containing Riemann tensors of the background. Scalars made by contracting V M V„/i pCT 
with powers of S larger or equal to 4 are linear combinations of those of the list as can be seen via the use of 
the Cayley-Hamilton theorem to replace such powers of S by linear combinations of cubic and lower powers 
of S. 
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the vanishing of the parameters fa and fa). As we recalled above, this requires in particular 
computing the generic variation of a square root (= S ) once the variation of $ (= S 2 ) 
itself is known. 

4.1 Varying the equations of motion 

We aim here at giving an explicit covariant expression for the linearized field equations of 
dRGT theories which read 

8E py = 5Q pu + m 2 5V pv = 0 . (4.1) 

where all quantities are computed at first order in the perturbation h pu of the dynamical 
metric g pu (henceforth providing a background over which the h P y propagate, and it is to be 
understood that all quantities that are not part of the perturbations are evaluated w.r.t. to 
this background solution g tw ), and the metric f )w is considered fixed (as it appears so in the 
action). Here, the variation of the Einstein tensor is given by, 

5Q„v = V a h pcT + l [g P „R pa - SffiR] V = £ p y pa h pa , (4.2) 

where, for later considerations, we have defined 

W = [dffiV 2 + cTVnV,, - 5 p V°Vy - 5 P V°V P - M pff V 2 + h pa , 

(4.3) 

which contains all terms with two derivatives acting directly on h pv . We also implicitly 
defined £ jW prT which we used in our discussions of the previous section. The variation of the 
interaction contribution (2.24) can be obtained using Eq. (2.23) and is given by, 

sv P y = i vn + \ g,As 2 ) p a \fot + fo(eii - s)}\ 

- 3 n k 

n =1 k =1 m= 1 

~ 7) 9fi\ \p\% + faefa + /3 3 (e 2 l + 5^)] s SS 5 U + (/i ++ v) , (4-4) 

where it is understood that S is given in terms of the background solution g pu and the chosen 
f P y (as opposed to SS p a and 5[S 2 ] p <r which also depends on h pu ). In order to arrive at the 
form above we have manipulated the sums in order to isolate all the terms which involve the 
explicit variation of the square root matrix S on the last row. All other terms either contain 
h p y directly or through 8[S 2 ] p a = -g px h XK [S 2 ] K a . 

Note that the fact that f pu is fixed suggests another way to go for the special case of the 
fa model. Indeed, in this case, the field equations can be rewritten as in Eq. (2.28). Squaring 
this expression, we can obtain the following expression for f pv 

fpu = Wi 9flU ~ ~ 6^) + ^2 (r w R p „ + - 3^) ’ ( 4 - 5 ) 
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and then use the fact that df fiv = 0 to obtain linearized field equations by writing the variation 
of the right hand side above, which is not very difficult to get and in particular much easier 
than linearizing S. We give in appendix E the corresponding expressions and check also there 
that the two ways to go indeed match, as they should. This requires the use of some non 
trivial syzygies which are given in appendix B.14. 

4.2 Variation of the square root matrix & the Cayley-Hamilton theorem 

In order to compute the variation of a square root matrix, dVl-P, once the variation of the 
matrix itself, 5S 2 , is known it is very useful to use the Cayley-Hamilton theorem. 12 The 
Cayley-Hamilton theorem is the statement that any square matrix (say dx d) itself obeys its 
characteristic (or secular) equation, i.e. the polynomial equation obeyed by its eigenvalues A 

p( A) = det(5 — Al) = 0. (4-6) 

The theorem therefore states that p(S ) = 0 and can be conveniently stated using the scalar 
functions ek{S ) as 

d 

]T(- l) n S d ~ n e n (S) = 0. (4.7) 

n =0 

For a 4 x 4 matrix S the Cayley-Hamilton theorem guarantees that, 

S 4 - ei S 3 + e 2 S 2 - e 3 S + e 4 l = 0. (4.8) 

An obvious virtue of this theorem is that it can be used to reduce any polynomial in S so that 
it contains no powers of S higher than three, a fact we make frequent use of in our analysis. 
Computing the variation of (4.7), using (2.23), we get, 

d— 1 - d n 

= - ]T ^2(-l) n+k e n - k (S)S d - n Tr[S k - 2 6S 2 ], (4.9) 

71=0 71=1 k =1 

This relation is very useful in computing 5S since the right hand side is given entirely in 
terms of 5S 2 . We may proceed somewhat further in the general case by splitting the sum 
on the left hand side into even and odd terms, but since we are mostly interested in dealing 
with d = 4 for now we will simply restrict to this case immediately. In d = 4, as can be seen 
starting from (4.8), only two terms contain odd powers of the matrix S, one of which is linear 
and the other one cubic. Writing S 3 = S 2 ■ S and isolating the terms with 5S we then obtain 
the following matrix equation for the variation, 13 

,.4 n 

[e 3 l + ei5 2 ]<55 = <55 2 [5 2 - ei S] + [S 2 + e 2 l]<55 2 - - ^(-l) n+fc eri _ fc 5 4 - n Tr[5 fc - 2 55 2 ]. 

n= 1 k =1 

(4.10) 

12 We thank S.F. Hassan for initial discussions about this point. 

13 We note that if we had chosen to write S 3 = S ■ S 2 we would have instead obtained an equivalent equation. 
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In order to completely isolate 5S we must assume that the matrix, 


X = e 3 l + ei<S 2 , 


(4.11) 


appearing on the left hand side of equation (4.10) is invertible, i.e. that det(X) ^ 0. This 
invertibility is equivalent to the statement that the matrices S and its negative —S have no 
common eigenvalues, as it is proven in appendix B. This matches the necessary and sufficient 
condition for the existence of a unique solution of the Sylvester equation (3.3) obeyed by 6S 
(see e.g. [61]) and we will assume here that this condition holds. Note that backgrounds 
for which this is not the case are such that the theory cannot be unambiguously linearized. 
When X is invertible, using again the Cayley-Hamilton theorem one can compute the inverse 


as, 


X” 1 


1 


3 


£(—irX 3 -e n (X), 

n =0 


(4.12) 


where we recall that e 4 (X) = det(X). With the assumed invertibility of X and using (4.10) 
we then arrive at the matrix relation, 


5S 



eiS] + [S 2 + e 2 t]5S 2 


1 4 m 

2 E E(- 1 ) m+fee — kS 4 - m Tr[S k ~ 2 8S 2 } 

m =1 k =1 


(4.13) 

Note that all terms on the right hand side are now given entirely in terms of 5S 2 and can 
therefore easily be computed in terms of metric perturbations using S 2 = g~ l f ■ Eq. (4.13) 
thus provides the general expression for the variation of the square root of a matrix in terms 
of the variation of the matrix itself. We note that this expression might be useful not only 
for computing perturbations in dRGT theories, but also in their bimetric extension or other 
generalizations. In the appendix B we provide a proof that the above expression (which, as 
far as we know, is new to this work) matches the one known in the mathematical literature 
and obtained from the Sylvester equation [61]. 


4.3 The general perturbation equations in dRGT theories. 

After having obtained the structure of the generic variation of 5S we return to the dRGT 
theories, for which d[S 2 ] p a = —g pX h\ K ,[S 2 ] K ' a . For our purposes it is convenient to first ma¬ 
nipulate the matrix prefactor X” 1 (c.f. Eq. (4.12)) even further. For any d x d matrix M we 
note the identity, 

k 
k 

This can be used to obtain e n (X) in terms of e n (S 2 ). From the definitions (2.20) of the e n 
and the Cayley-Hamilton theorem (4.7) it is then possible to prove that, 14 

ei(S 2 ) = e\-2e 2 , e 2 (S 2 ) = e|-2eie 3 + 2e 4 , e 3 (S 2 ) = e\ - 2e 2 e 4 , e^S 2 ) = e 2 . (4.15) 

14 Similar relations were noted in [65] in terms of the eigenvalues of triangular matrices but here we provide 
the completely covariant expressions in terms of the matrices themselves. 


A fc e fc (M) 


(4.14) 


( d 

e„(l + AM) = V 
fc=o 
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These relations can be used to show that, e.g. 


e 4 (X) = [efe 4 + e| - eie 2 e 3 ]' 


(4.16) 


along with similar relations for the remaining e n (X). Using these relations together with 
a binomial expansion of X 3 ~ n and a judicious use of the Cayley-Hamilton relation (4.8) to 
reduce all powers of S higher than three, we find that (4.12) can be written, 


x _i _ (e 3 - eie 2 ) 1 + ejS - eiS 2 
efe 4 + e\ — eie 2 e 3 


(4.17) 


Inserting this in (4.13), again making use of the Cayley-Hamilton theorem to reduce all powers 
of S higher than three, we arrive at 


5S X U 1 „ 

Sgpo 2 y 


e 4 Cl (Sffi + 5"5 P - g,^r) + e 4 c 2 (s^ + S?5 P - S^gT ~ 9„uS pa ) 
- e 3 ci (s p S° + 5°S pX ) + (e 2 c 4 - e 4 c 3 + e 3 c 2 ) S^S pa 

s;i s X + W S X - g pa [s 2 U + s p [s% + KisXp - gAS 2 r 


+ e 4 c 3 


- e 3c 2 [S£SZ + SZS p ) - e 3 c 3 ( SZ[SX + S P [SX + S P [SX + S °\ S Xp 


+ (e 3 c 3 - ei Cl ) {S^[S\ U + S^SX”) - (ci - e 2 c 3 ) ([SX^X + [SX\SXp 

+ c 4 [S\ v [SX a + Cl ([S%„S p ° + s^s 3 ]*”) + C 2 {[S 3 U[SX a + [S\AS 3 r 


+ C 3 [5 3 ]^[5 3 ]^ 

where the coefficients Cj are given by, 

e 3 — eie 2 


(4.18) 


ci = 


c 3 = 


p 2 

e i 


—eie 2 e 3 + e 2 + efe 4 ’ 2 — eie 2 e 3 + e 2 + e 2 e 4 


-ei 


-eie 2 e 3 + e 2 + e 2 e 4 


, c 4 — 


e 3 - ef 


-eie 2 e 3 + e 2 + e 2 e 4 


(4.19) 


and we recall that all the e n are functions of S and indices are raised and lowered using the 
background solution g pv . 

Let us summarize our results so far. Inserting (4.18) into (4.4) allows us to write the 
equations of motion (4.1) as, 


SE pv = SO,,,, + m 2 M^hpa . (4.20) 

where we recall that, 

SQ IW = £^ pa h pa + i [g pv R pa - SffiR] h pa , (4.21) 
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and the mass matrix M. pv prT is defined through, 


M pa = 8Vpu 

pu ~dg pa ’ 


(4.22) 


and given explicitly by, 

= \v,”K - \ (fhti + Ai (e^ - S ^)) [S 2 ]^ S p 

3 n k 

+ 4 E E E (-l) n+k+m Pne k - m [S n - k }\g„xg T ( p [S m y\ 


n =1 k =1 ra=l 


(4.23) 


1 < 55 ^ 

— 9 (Ai $ t \ + A2 ei 5 r A + A3 (e2 < 5 t a + [ 5 ,2 ] r A )) ——— + (// -h- z/). 

^ OQ per 


Together with the variation of the square root in (4.18) this can now be used to get the 
expression of 5E pv as an operator acting on the perturbation h /iu . This completes the general 
calculation of the equations of motion for perturbations in massive gravity with an arbitrary 
reference metric f pu and an arbitrary background solution g pu . Note that in these equations, 
f pv only arises through the presence of the tensor S fw , and one can in fact trade one for the 
other. Moreover, we recall that in the special case of the Ai model introduced above, one can 
explicitly express S pv as a function of g jJV and its curvature using equation (2.28), henceby 
obtaining in this case field equations for the metric perturbation h pv which only depend on g pu 
and its curvature. One can also check that to the lowest non trivial order (i.e. at linear order) 
in curvature, the field equations found here agree with the ones of [44] which investigates at 
this order the consistent coupling of a massive graviton to a curved background. 


4.4 Action for a massive graviton 

The above obtained equations of motion derive of course from an action obtained from the 
dRGT action expanded at quadratic order into the metric perturbation. This action, to be 
given below for completeness, can hence also be used as a starting point for a theory of 
a massive graviton, with the correct number of propagating polarizations on an arbitrary 
background (which we prove here in the case of the Ai model). 

In order to get the action for perturbations, the interaction term 

S m = —2M 2 m 2 J d. (4.24) 

can be expanded at quadratic order into the metric perturbations as 

s£» = -m 2 J . (4.25) 

where, 

dg^dgP a ~ = 2 + -^\fW\9^ v pcr , (4-26) 
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or 


where, 



d^xSg^ 


d 2 (VW\V ) 

9gpydg pcr 


^ 9per ) 


(4.27) 


- = \V\9\M^ + \V\g\g^V pa - \yM<rV va ~ \V\9\9^V vp . (4.28) 


depending on whether one uses 5g pi/ or Sg P y as independent variables. Note that the left hand 
side of the above equation (4.28) has been computed by Guarato & Durrer [65] who worked 
it out however in terms of the eigenvalues of an S that was taken to be in upper triangular 
form. We have verified that our results coincide with theirs for diagonal g pu and f pu . We 
note however that our expressions are explicitly covariant while this is not the case for those 
of Ref. [65] which uses a coordinate system where S is in triangular form. 15 

The relations (4.25) or (4.28) provide an explicit map between the second variation of the 
interaction term of the dRGT action and M p J >tJ which can be considered as a mass matrix. 
Note that this matrix depends in general (either implicitly, or explictly in the (3\ model where 
can be traded for R pv ) on the curvature of the background metric g pu . Computing now 
the full action (2.17) at quadratic order we get easily (using the background field equations 
as well as the above expressions) the following quadratic action 

= Am] J d 4 x^/\g\hpy + m 2 M p " p °) h pa , (4.29) 

where the expression for the kinetic tensor £P u P a has been provided before. This action, when 
varied, yield indeed the held equations (4.20). Note however that neither £P u P a nor M. pvpa 
as given in (4.2), (4.3) and (4.23) are manifestly symmetric in the pairwise index interchange 
(gv) -H- (per). The combination above however is indeed symmetric as can be seen after using 
the background equations to express everything either in terms of S or in terms of curvature. 

5 The /3i model & the fifth constraint 

The aim of this section is to show, in a covariant and Lagrangian way, that the above derived 
held equations indeed lead to a scalar constraint according to the general guidelines outlined 
in Sec. 3. We will do so for the /3i model introduced in Sec. 2.4, where any reference to the 
non dynamical metric f pu can be fully eliminated, and provide the details of the calculation 
for this case. 

15 The analysis done in the last part of [65] about FLRW space-time is however not correct, as we will discuss 
in more detail in section 6.3, since Ref. [65] does not work with valid background solutions. 
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We recall that the j3\ model is defined by /3 2 = Pz = 0. In this case the mass matrix 
■M-fj, u pcT is easily obtained from (4.23) and the equations of motion are given by, 


SEp,, = 5G P v+m 2 Po h/jv+mPPi e\ h^-mPPi h A (l/ S^ x - 


m 2 /3i 


A 3* 9W SS fj) 


9/j.u S\ hr a —m Pi 


*9, 


L p<J ? 


per 


(5.1) 

where the last term is obtained from (4.18). As discussed in Sec. 3, we search for a linear 
combination of the scalars and 4/* (with i < 3), i.e. for a set of coefficients such 

that (3.8) holds. An explicit calculation of the relevant scalars reveals that 


$0 ~ 
$1 ~ 
$2 r ^ J 




~(v p V a h^-Dh), 

\ [2 s^ V a V p K - S pv □ V - S ^ h + ei^o] , 

\ [2 [s 2 r v a v M k - [s 2 r □ v - p 2 r v,v M h +( e ? - 

\ [2 [5 3 ]^ V a V p K - [S 3 r □ v - P 3 r V,V M h+(e\- 


2e 2 ) 4>c 


(5.2a) 

(5.2b) 

(5.2c) 


3eie 2 + 3e 3 ) 4> 0 ( 5 .2d) 


where h = h p p . In order to compute the 'Iq efficiently we first note that the variation of the 
Bianchi identity, i.e. 5 (V M G P u) = 0, implies, 


V"' 6G IW - h pp V p G P u - Gau V p \fp +-Go 


V CT h - J G P ,o v, hr = 0 . (5.3) 


Using Eq. (2.26) (or equivalently (2.28)) to express above every occurence of the Einstein 
tensor G P ,v in terms if S pv , we then get the following expression for the 4q’s, where again we 
only retain terms with two derivatives acting on h pL/ , 




m 


M pa [5Ta V A V^ v + pi [S i+1 u v A v p hr - 4 [S i+1 u v A v ff h 


Pi 


+ Y s w [^Ta v a v, bT - (Po + Piei) [STx V A V P K 


(5.4) 


We stress here that we can view Eq. (2.26) (or equivalently (2.28)) just as a definition of S in 
terms of the curvatures of the metric g pv and that using this equation does not result in any 
restriction on the metric g pv . A close examination of the expressions above reveals that the 
4>i’s and the 4q’s are linear combinations of 26 different irreducible scalar terms, Kj, which 
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we denote in the following way (see footnote 11 ) 


J\ = VpV CT h pa , 
h = S pa V P V A h \, 

H 1 = [S 2 rV p V x h\, 

G\ = S pa S pv V P Vi, h au , 

^ = [s 2 r s^VpVphw, 

D 1 = [S 2 r [S^VpVnhw, 

Cx = [S 3 ) pa S pv V p \7 p h av , 

Bl = [5 3 ] pcr [S 2 ]^ VpVp h av , 

yl! = [5 3 ]^[5 3 rv p V^/» w , 


J 2 = □ /i, 

I 2 = S p<J V p V a h, 

H 2 = [S 2 rv p V a h, 

G 2 = S pa S pu V P V a h pu , 

E 2 = [5 3 ] p(T VpV CT /i, 

s 2 = [s 2 r v. 

n 2 = [5 2 ] pff [5 2 rVpV ff V> 

C 2 = [S 3 rS p "V p X7 a h pu , 

B 2 = [s 3 r [5 2 rvpV CT v. 

yi 2 = [s 3 ris 3 rv p v a h p „. 



(5.5) 

/ 3 = S pa □ h prJ , 

(5.6) 

H 3 = is 2 roh pa , 

(5.7) 


(5.8) 

F 3 = [s 3 roh pa , 

(5.9) 


Es = [S 2 r S pv V p V v h pa , 

(5.10) 

(5.11) 

c 3 = [s 3 r^v,v,v» 

(5.12) 

B 3 = [5 3 ] pCT [5 2 ]^VpV,V- 

(5.13) 

(5.14) 


For future reference, we order the M* from left to right and top to bottom, i.e. we define 
Mi = Ji, M 2 = J 2 , M 3 = Ji, M 4 = I 2 , efc. The expression of the Tj’s in terms of these are 
given by 


$0 ~ 

$1 ~ 


(J) 2 r^i 


<Jj>2 r\j 


~ (Ji ~ <h) , 

\ [2/i-/ 3 -/ 2 -ei(Ji- J 2 )] , 

^ [2 H x - H 3 - H 2 - (e 2 - 2e 2 ) (Ji - J 2 )] , 

- 2 Fi — F 3 — F 2 — (e 3 - 3 eie 2 + 3e 3 ) (Ji - J 2 ) , 


(5.15) 

(5.16) 

(5.17) 

(5.18) 


where we have used that Ji ~ S V Q V^ /i" after commuting derivatives together with similar 
relations for H\ and F\ . The expressions for the T.j in terms of the scalars Mj are given by 

T 0 = !n -^ 1 [-cie 4 (Ji - \ J 2 ) + (cie 3 - c 2 e 4 ) I\ - \cie 3 I 2 + \c 2 e± / 3 

—c 3 e 4 (2 ifi — ^H -2 — \H 3 ) + c 2 e 3 Gi — ^ (c 4 e 2 + C 2 e 3 — c 3 e 4 ) G 2 + 2c 3 e 3 Ei 
+| (ciei - c 3 e 3 ) [E 2 + E 3 ) + (ci - c 3 e 2 ) Di 

+\ (c 2 ei — ci + c 3 e 2 ) S 2 — \c\ [C 2 + C 3 ) — \c 2 [B 2 + B 3 ) — ^c 3 A 2 ] , (5.19) 


T 1 = m 2 1 [ — ■ci e 4 (7i — 5/2 — ^7 3 ) + cie 3 [H\ — ^H 2 ) + ^C 2 e 4 H 3 — C 2 e 4 G\ 

— \ (cie 3 — C 2 e 4 ) G 2 — c 3 e 4 [Fi — \F 2 — \F 3 ) 

+ (c 2 e 3 - c 3 e 4 ) (Ei - \E 2 - \E 3 ) + c 3 e 3 [D x - D 2 ) 

+c 3 e 3 [Ci — \C 2 — \ C 3 ) + (ci — c 3 e 2 ) [B\ — ^B 2 — |i? 3 )] , (5.20) 
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^2 = l c 3^4 (Jl ~ 5 ^ 2 ) — £36364 ( 2/1 — i /2 — 5 / 3 ) — (ci — 6362 ) e 4 ( 2 Hi — TjH -2 — \H 3 ) 

+£ 3^1 G \ + 5 (£164 — £363) G2 + (£163 + £264) (i 7 ! — ^2) + (£463 — 2636263 — £264) E \ 

(cie 3 - £36263 - £ 264 ) (E 2 + E 3 ) - (cie 2 - c 3 el + £ 364 ) T>i 
-5 (c 2 e 3 - £ie 2 + £362 - £ 364 ) D 2 

+^£364 (C 2 + C 3 ) — \c 3 e 3 (B 2 + B 3 ) — i (ci — £ 362 ) A 2 ] , (5.21) 

^3 = [— (cie 3 e 4 + £ 264 ) (Ji — \ J 2 ) + (fiiel + £26364 + £ 364 ) (l\ — \l 2 ) — \c 3 e\l 3 

- (cie2e 3 + 62e 2 e 4 + £3e 3 e 4 ) (Hi - \H 2 ) + \c 3 e 3 e 4 H 3 - £36364 (Gi - G 2 ) 

+ (cieie 3 - £ie 4 + c 2 eie 4 + c 3 e 2 e 4 ) (Fi - \F 2 ) + \ (c\ - c 3 e 2 ) e 4 F 3 

- (cie 4 - e 3 e| - c 3 e 2 e4) (E x - i ^ 2 - 5 ^ 3 ) + (ci - £ 362 ) e 3 Di 
~\ (cie 3 - £ 2 e 4 - 2c 3 e 2 e 3 ) D 2 

- (c 3 e 2 e 3 + c 2 e 4 ) C x -\ (ci - £ 3 e 2 ) e 3 (C 2 + C 3 ) 

- (cie 2 - £ 3 e| + £ 3 e 4 ) (Bi - i ^ 2 - 5 S 3 )] . (5.22) 

Using the above expressions for the <3?j and the 'hj one can obtain the explicit form of the 
coefficients a* as defined in Eq. (3.9). The corresponding expressions of the ct* are given in 
appendix C. Having obtained these expressions, it may seem now that solving equation (3.8) 
requires to solve 26 equations for 7 unknown ratios of the these 26 equations being 

obtained by setting to zero independently each scalar coefficient of each Nj appearing in the 
linear combination on the left hand side of (3.8). A closer examination of these equations 
shows that they reduce to a homogeneous system of 8 independent equations whose only 
solution is the trivial one (with vanishing Ui and v{). However, one notices that the N* of 
equations (5.5) to (5.14) are not all independent from each other but are linked by non trivial 
identities - syzygies - as we now show. These identities, as a consequence of the ’’second 
fundamental theorem” of invariant theory [62, 63], can be derived using the Cayley-Hamilton 
theorem which we already stated before. For clarity, let us rewrite here this theorem for an 
arbitrary 4x4 matrix M, as 

M 4 = ei (M)M 3 - e 2 (M)M 2 + e 3 (M)M - e 4 (M) 1. (5.23) 

One can then apply this to a matrix M built out of 4 arbitrary matrices A, B,C,D and four 
arbitrary real numbers {x,} in the form, 

M = xoA + x\B + x 2 C + x 3 D . (5.24) 

Now, because the {xi} as well as the matrices A, B , C, D are arbitrary, it means that in 
equation (5.23) the terms which have the same degree of homogeneity in the {x^} must each 
yield separate identities between the matrices A, B, C, D. Once these identities are obtained, 
one can replace in them A by h, B by S, C by S 2 and D by S 3 to get non trivial matrix 
syzygies, denoted here as = 0, between the tensors of interest. For our purpose here, 
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since we are interested in relations where appears linearly, it is enough to consider syzygies 
derived by looking at terms linear in xo (but this is the only restriction imposed on the x{). 
The corresponding syzygies [T^v = 0 are gathered in appendix D. The next step is to contract 
each of these syzygies with V^V 1 " and retain there only the terms with two derivatives acting 
directly on the perturbation h^ p . In the whole process, we use again the Cayley-Hamilton 
theorem to eliminate any powers of S greater than three. At the end of the process we obtain 
the following four independent identities from the corresponding equations (D.12)-(D.15), 


A\ — A 2 — e 2 {D i — D 2 ) — e3 {2F\ — F 2 — F 3 ) — (e 4 — eie 3 ) (G\ — G 2 ) 

o (5.25) 

— (e 4 — eie 3 ) {2H\ — H 2 — H 3 ) — e 2 e 3 (2/i — I 2 — I 3 ) + e 3 (Ji — J 2 ) ~ 0, 

2B\ — B 2 — B 3 — ei {D\ — D 2 ) + e 3 (Gi — G 2 ) — e 4 (2/i — I 2 - I 3 ) ~ 0 , (5.26) 

2C*i -C 2 -C 3 + (D 1 - D 2 ) + ei (2F 1 -F 2 - F 3 ) - (ef - e 2 ) {G x - G 2 ) . 

o (5.27) 

— e x (2 Hi — H 2 — H 3 ) + eie2 ( 27 i — I 2 — I 3 ) — {eie 3 + 64) (Ji — J 2 ) ~ 0 , 

2 E x -E 2 -E 3 + (2F\ -F 2 - F 3 ) - ei (Gi - G 2 ) - e, (2H l - H 2 - H 3 ) 2g 

+ e 2 (2/i — I 2 — I 3 ) — e 3 ( J\ — J 2 ) ~ 0 . 


Fortunately, these equations are precisely enough to change our original system of 8 (which 
only have the trivial solution Ui = V{ = 0) equations into the following system of 7 equations, 


ei v 0 - e 3 v 2 = 0 , 

(5.29) 

vo + e 3 v 3 = 0 , 

(5.30) 

v\ - e 2 v 3 = 0 , 

(5.31) 

m 2 j3i 

u 0 + 4 e 4 v 3 = 0 , 

(5.32) 

u\ = u 2 = u 3 = 0 . 

(5.33) 


We recall that we are only interested in 7 ratios of the {ui,Vi}, so this system has a unique 
solution. With this solution we find the identity (3.10) which in turns results on shell into 
the announced constraint 


m 2 Bi eA , 

-—-<h 0 - e 3 T 0 + e 2 'Fi - ei ^2 + ^3 = 0 . (5.34) 

We will below detail some applications of our results. In particular, looking at the flat space¬ 
time case will allow us to prove that the above constraint (5.34) is independent of the vector 
constraints (3.4) as it should in order to remove an extra degree of freedom. 
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6 Some applications 


6.1 Flat space-time limit 

We first look at the flat space-time limit of the above constraint (5.34). Choosing g^ v to be 
the canonical Minkowski metric r)^ v we get from (2.28) that in this case, one has 

^ v • <«■« 

And one obtains the field equations operator 


^Ef W — $Guu 


m 2 /3 0 


[Ah' 


(hfiv h g^iu) i 


( 6 . 2 ) 


which corresponds to (2.3) with the identification m 2 = —m 2 0o/3. This of course implies that 
one should impose 0o < 0 in order to have a healthy massive mode. Furthermore, because 
efc(c • 1) = c k ()|) one finds from (6.1) that 


ei = 


40o 

'30i 


e-2 = 


2 /?o 2 


m 


2 ’ 


S3 = 


27 0? 


3 ’ 


and in this case the constraint (5.34) reads 

2 00 


0 o 3 


540f 


d ll d v 6E >lv - 


m 


6 


rf’ 1 '5 = -m 


e 4 = 


$ 




■4 ’ 


64803 s 


810 i 


h = 0, 


(6.3) 


(6.4) 


where the first equality expresses the identity (3.10) (which holds here as stated above, i.e. 
without the neglecting of terms ~ 0) and the second one results from the field equation. 
Hence, ones finds back (2.6) and (2.7), as expected. Moreover, this shows that the constraint 
(5.34) is in the general case (i.e. not assuming anymore that g^ u is flat) independent of the 
vector constraints (3.4). 


6.2 Einstein space-times 

Let us study now the more general case of Einstein space-time backgrounds. Such a space-time 
obeys the following equations 


R 


fllS 


Gnu — A <7 


1 w 


R = 4A, 


(6.5) 

( 6 . 6 ) 


where A stands here for the background cosmological constant. Inserting these relation into 
(2.28) we get 




(6.7) 
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where A) is given by 


A) = A)- 2 ' (6-8) 

m z 

This means in particular (see equation (6.1)) that the relations (6.3) hold provided one re¬ 
places there A) by A) . It is easy to use the above relation (6.7) to deduce from the flat 
space-time discussion above that the field equations operator is given by 

SE )W = + A (h lw - h g^) . (6.9) 

This can of course also be checked by a direct calculation from Eqs. (4.20)-(4.23). Notice that 
the first two terms on the right hand side are just the linear expansion of Q fW + Awhile 
the last two terms have the flat space-time Fierz-Pauli form provided one replaces A> by A ). 
This structure of the massive spin-2 equations is well known and similarly to the flat space 
case the mass of the spin-2 field is identified by m| p = —m 2 (3 o/3. A direct calculation, or the 
use again of the previous results for flat space-time (together with the identity (5.3) which 
now contains a non trivial piece with undifferentiated h^ u ), yields the following form for the 
constraint (5.34) 

-M, (\7 fl \7 u 5En U - m 2 ^g flu SE flu ) = h (1 + = 0 . (6.10) 

54/3? \ ^ 6 y ,,v ) 648A 3 V Porn 2 J V ' 

Apart from the obvious general solution h = 0 there is now also a special case when the 
parameters satisfy 2A = —f3om 2 and the constraint equation is automatically satisfied and 
becomes an identity. This corresponds to the well known Higuchi bound 2A = 3mp P [45] 
and for this special case it is known that the theory has an enhanced linear gauge symmetry 
which renders the helicity-0 mode of the massive spin-2 field non-propagating [ 66 ]. This is 
however an artifact of linearization and it has been shown that the symmetry only exists for 
Einstein backgrounds and can not be extended to general backgrounds within the framework 
of dRGT theories [67-70]. 


6.3 FLRW backgrounds 

We now assume that g^ v parametrizes a Friedmann-Lemaitre-Robertson-Walker space-time 
in conformal time, with the line element given by 


dSg = g/ivdx^dx 1 ' = a 2 (rj ) [-dr? 2 + 'y i jdx l dx j ] . 

The corresponding non-vanishing components of S are obtained using (2.28) as 


Sn U = - 


S) = 


+ 


1 


A) 

3Al ' m 2 Pi a 2 


[~n 2 + 2 U' - K ) 


+ 


1 


Po_ 

3Al ' m 2 Pi a 2 


{n 2 + K ) 




( 6 . 11 ) 

( 6 . 12 ) 

(6.13) 
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all other components being zero, where A = 0, ±1 parametrizes the spatial curvature as 
usual, and A = a'/a is the Hubble factor of the conformal metric (6.11) (a prime denoting 
here a derivative with respect to the conformal time rf). For convenience we further define 


A) ^ i 

So — — 777;-b 


Si = 


3/?i m 2 /3i a 2 

A) + i 


3/3i m 2 /3± a 2 


(-a 2 + w! - a) , 

(a 2 + /f) , 


(6.14) 

(6.15) 


such that Sq = s o and S'® = s\5\. One also has the following useful relations 


ei = s 0 + 3si, e 2 = 3(s 0 + si)si, e 3 = (3s 0 + si)s 2 , e 4 = s 0 s? , (6.16) 

Squaring the tensor S we obtain as 


/oo — —a 2 Sq — —a 


t 2 2 2 

Jij = a s 1 t ij = a 


ft 


+ 


2/? 0 


9/32 3m 2 ^a 2 


/?o 2/3 q 


(A 2 - 2A' + A) + 


1 


m 4 /3f a 4 

1 /_.<} , a 2 


9/3 2 3m 2 /3 2 a 2 

We can then compute the mass matrix, 


(A 2 + AT) + 


(A 2 - 2A' + A)" 


m 4 /3 2 a 4 


(A 2 + A')' 




A4 0 o°° — /3 q + 3si/3i — 


m 2 a 2 


(A 2 + A) 


1 

2, 1 ' 


-V = F/3iSi7 2J = 


7*, 


» > 00 1 n 

■My = -/3is 0 7zj = 


i A+ 2^ (W2 + ^ 




- 5 


1 


A) + A 


2m 2 a 2 

So + 3soSi + 3s 2 x 

so + si 


7*i 


(6.17) 

(6.18) 

(6.19) 

( 6 . 20 ) 
( 6 . 21 ) 




1 j-m 2 a 2 $ - |/3 0 (A 2 + K) 4-4 (A 2 + A)" 

L_(A 2 + 2A' + A)+9 -2- ™ 2q2 -- 


A4 - M = 

*7 


2m 2 a : 


A + 2 A (^sq + 5si) 1 


4A'-A 771 - 2 ® 2 

3 


<f. 

A 5 

( 6 . 22 ) 


<*($) - 9 A Sl Ai7 


kl 


1 1 

~f> 0 + 2^2 


(3 (A 2 + A) + 4A') 


*(4 + 




2m 2 a 2 


7*17 
(6.23) 


Using the above expressions in those of sections 4.3 and 4.4 we obtain the equations of 
motion and action for a sound (as far as the counting of d.o.f. is concerned) massive graviton 
propagating in an arbitrary FLRW space-time. Notice that a similar problem was also studied 
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in Ref. [65] starting from a similar point as us. However, it was assumed there that both the 
dynamical metric g^ u and the non dynamical metric can be taken to be conformally flat. 
Here we have shown that this assumption is not correct. Indeed, assuming (6.11) leads to 
the form (6.17)-(6.18) for Said in another way, this means that the metrics considered 
in [65] are not solutions of the background field equations of the dRGT model, and it can in 
fact be checked that any such solutions where g )W and are proportional to each other via 
a coefficient c must be such that c is a constant. As a result, the theory found in [65] for a 
massive graviton on FLRW space-time is not the correct one obtained from expanding the 
dRGT model and hence very likely does not yield the correct number of propagating degrees 
of freedom. In constrast, in our case, one can check using in particular the relations (6.16), 
that the equation (5.34) is indeed a constraint. 

7 Conclusion 

In this paper we have given the detailed derivation of the field equations for a massive graviton 
propagating on an arbitrary background metric as obtained from the dRGT model. For the 
simplest of these models, we have shown how the non dynamical metric f which appears 
in the formulation of the dRGT theories can be explicitly eliminated out of these equations 
leaving just one metric g^ v which serves as a background. Starting from these equations, we 
have shown how to obtain a scalar constraint which plays there the role of the tracelessness 
of the massive Fierz-Pauli graviton obtained on flat backgrounds. We stress that, once one 
considers the linearized equations as expressed uniquely in terms of g^ w and its perturbation 
h^ u , the constraint is identically satisfied for an arbitrary g /w , i.e. irrespectively of the fact 
it is a solution of the background equations of motion. Hence, the obtained linear equations 
(respectively the obtained action) for h^ u can be considered as giving general equations of 
motion (respectively a general action) for a massive graviton with (at most) 5 polarizations 
on an arbitrary background, irrespectively from its origin in the dRGT model. It involves in 
particular a mass matrix which depends in an explicit and very specific way on the background 
curvature. This work can be extended in various directions. E.g. an obvious question is 
whether a similar constraint can be obtained when one does not assume that and vanish, 
i.e. in the general dRGT model [71]. Note further that massive gravity, including its dRGT 
version, suffers from various possible pathologies introduced in particular in the reviews [5, 6]. 
Some of these pathologies, which exact meaning and consequences are subject to a debate, 
would also arise in the linearized version of the theory we presented here. This concerns in 
particular the issue of superluminal propagation and related causality issues discussed e.g. 
in [37, 52-58, 72], and which can be seen at the level of the linearized theory and might 
invalidate some specific background. We refer the reader to these latter references as well as 
to the reviews [5, 6] for further discussion of these and other potential problems of massive 
gravity. 
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A An inductive proof of the variation 8e n 


Here, for completeness, we provide an inductive proof for (2.23). Starting from eo = 1 and, 

1 n 

e n (S) = —Y / (-l) k Tr[S k ]e n - k (S), n> 1, (A.l) 

n 

k =1 

the claim is that 

n 

5e n (S) = - ^(-l)*!^*- 1 ^ e n _ k (S) , n > 1. (A.2) 

k =1 


For n = 1 we find that this formula gives dei(.S') = Tr[<5<S], which is the correct result since 
ei(S’) = Tr[5]. Let us now assume that (A.2) is true for all k < n and compute the variation 
of 5e n -|_i directly from the defninition (A.l), 


n+l 


5e n+1 (S) = - —■y ]T(-l) fc |^Tr[(5S' fc ]e ri _|_i_fc + Tr [S k ]Se n+1 . k 

U 1 k =1 

^ n+l n+l —k i- 1 


^n+1— k—r 


k =1 m= 1 
n+ln+l —k 


-Tr[<55 fc ]Tr[5 m ] + -l+S+TrLS*] 
n + l — k m 


[-tE E (-l)‘ + ’”e„ + i_ fc _„Tr[«S l ]Ti[S* 


n + l 


n+l 


k =1 m= 1 


l 


l 

n + 1 — k k 


n+l—/c 


1 ) fc TV[5 fc ~ 1 <5g] ) £ (—l) m Tr[S' m ]e n+1 

n + 1 — k ' 


—k—m 


k =1 
n+l 


m=l 


^^(—l) fc Tr[S ,fc - 1 dS']e n+1 _fc . 


(A.3) 


fc=i 


To get to the second line we used (A.l) and (A.2) together with the identity mTr[S' m ~ 1 <5S'] = 
Tr[dS' m ]. To get to the third line we used that for an arbitrary function f{k,m ) one has 
z+fc=i Y^=\ k /+ rn ) = Ukii Y^=\ k /( m ; k)- We then simply combined everything and 
again used (A.l). 


B The Sylvester equation and another form for the square root variation 

A much studied matrix equation in mathematics is the Sylvester equation, 

AX — XB = C , (B.l) 
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where A, B and C are known square matrices and X is to be determined. The matrices A 
and B are matrices not necessarily of the same rank. The Sylvester theorem [74] states that 
for every matrix C, there is a unique solution for X if and only if cr(A) n cr(B) = 0 where 
<r(M) denotes the spectrum of M. In this case the solution for X is also a known polynomial 
in the matrices A, B and C given by [61] 

Tg k — 1 

X = <Tb\A) EE (-l) k e A _ k (B)A k - m ~ l CB m , (B.2) 

k= 1 m= 0 

where r# is the rank of the matrix B and qs{M) is the characteristic polynomial for B , i.e. 

d 

q B {M) = ]T(-1 ) n e n (B)M d ~ n . (B.3) 

n =0 

The connection to our work can be seen when considering equation (3.3) which shows that 
5S obeys a Sylvester equation with the identifications A = S, B = —S and C = 5S 2 = 5$. 
The Sylvester theorem then guarantees that there is a unique solution for 5S if and only if 
cr(S') n cr(—S) = 0, i.e. when S and —S share no common eigenvalues. The solution for 6S 
can then be written, 

4 fc-i 

5S = -X - 1 (—1 ) m e 4 _ k S k - m - 2 5S 2 S m , (B.4) 

k= 1 m= 0 

where X = 63 ! + e 1 S 2 as dehned in (4.11). The converse statement that S and —S have at 
least one common eigenvalue, i.e. a(S ) n <r(— S) / 0 is in fact equivalent to the statement 
that X is not invertible as we will now demonstrate explicitly. For this, consider A E er(S'), 
which means that it solves the characteristic equation of S, 

A 4 — eiA 3 + e 2 A 2 — esA + e\ = 0 . (B.5) 

If in addition A E cr(—S) we must also have that it solves the characteristic equation of —S, 

A 4 + eiA 3 + e 2 A 2 + e^X + e 4 = 0 , (B- 6 ) 


or equivalently, using (B.5), 

eiA 3 + esA = 0. (B-7) 

The solution A = 0 implies that e 4 = det S = 0, i.e. that S is not invertible which is a basic 
assumption in our setup so the second equation reduce to, 

e 3 + eiA 2 = 0. (B.8) 


The set of equations for A can now be written, 

A 4 + e 2 A 2 + e 4 = 0 , e 3 + eiA 2 = 0 . (B-9) 
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One solution of the second equation is given by e 4 = e 3 = 0. In this case it is obvious that 
X = e 4 + is not invertible since it is the zero matrix. The other solution is A 2 = —e 3 /e 4 
which result in the condition, 


e| - eie 2 e 3 + e 2 e 4 = 0 . 


(B.10) 


From Eq. (4.16) this condition can be seen to imply e 4 (X) = det X = 0, again implying that X 
is not invertible so we have shown that cr(S) 0 a(—S) / 0 implies that e 4 (X) = 0. Conversely, 
from Eq. (4.16) it is clear that e 4 (X) = 0 implies, 


e| - e 4 e 2 e 3 + e 2 e 4 = 0. 


(B.ll) 


Obviously, by the assumption that S is invertible we have that e 4 / 0. Hence in this equation 
e 4 = 0 e 3 = 0 from which it follows that any A € cr(S) is also in a(—S). For e 4 / 0 we define 
A 2 = —e 3 /e 4 and it is then again clear that this A will be in both cr(S) and a(—S). We have 
thus established the equivalence between the statements e 4 (X) = 0 and cr(S) 0 a(—S) =/= 0. 

We also explicitly verify below that the solution of the Sylvester equation for 5S given 
by (B.4) coincides with the expression for 5S given in (4.13) and that we derived earlier. 
We denote 5S\$ the expression (B.4) and still use 5S for the expression (4.13). In order to 
simplify the expressions, we look at the difference 2SX (dS 1 ^ — dS 1 ) S and obtain that (where 
we use notations introduced in appendix D) 


[2SX (6S\ S - 5S) SY v = — e 4 ([G^ - \G' 2 f v + 2 [H[f v - [H' 2 \\ - [H'^ v ) 

+e 3 (2 [E[f v - [E' 2 \\ - [E' 3 r u ) 

-C2 {[D[r v - [D' 2 f v ) + ([A[r u - [A' 2 y u ) , (B.12) 


The difference (B.12) can now be rewritten as a function of equations given in appendix D, 


[2SX (SS\ s - 5S) S]* = (D.12) + e 3 (D.15) = 0. (B.13) 

As we assume X and S to be invertible this implies, 


5S |s = 5S. 


(B.14) 
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C Expression of the coefficients a* 


In this appendix, we give the list of all the coefficients as defined from Eq. (3.9) and 
computed using the expressions (5.15)-(5.18) and (5.19)-(5.22) given in the main text. 

a± = -u 0 - ~ei u\-\(e\ — 2 e 2 ) u 2 - \ (e\ — 3eie 2 + 3e 3 ) u 3 

+ [ _c i e 4 + c 3 e\ v 2 - (cie 3 e 4 + c 2 ef) v 3 ] 

a 2 = u 0 + \e\ u\ + \ (el — 2 e 2 ) u 2 + \ (ef — 3eie 2 + 3e 3 ) u 3 

+ [ Cie4 v ° ~ Cse 4 ^2 + (cie 3 e 4 + c 2 e 4 ) u 3 ] 

o 3 = m + m ^ 1 [( c i e 3 - c 2 e 4 ) v 0 - cie 4 ui - 2c 3 e 3 e 4 v 2 + (cie| + c 2 e 3 e 4 + c 3 e 4 ) u 3 ] 

1 m 2 P1 r / 2 2\ i 

a 4 = - -u i H-— [-cie 3 v 0 + cie 4 v x + c 3 e 3 e 4 v 2 - {c x e 3 + c 2 e 3 e 4 + c 3 e 4 ) u 3 J 

1 m 2 /3i r 2 i 

a 5 = - -ui H-— |c 2 e 4 v 0 + cie 4 v x + c 3 e 3 e 4 v 2 - c 3 e 4 u 3 J 


06 — U 2 + 


m 2 Pi 


[-2c 3 e 4 vq + c\e 3 v x - 2 (c 4 - c 3 e 2 ) e 4 v 2 - (cie 2 e 3 + c 2 e 2 e 4 + c 3 e 3 e 4 ) u 3 ] 


o 7 = - \u 2 + m f' [c 3 e 4 v 0 - cie 3 iq + (c 4 - c 3 e 2 ) e 4 u 2 + (cie 2 e 3 + c 2 e 2 e 4 + c 3 e 3 e 4 ) u 3 ] 


1 


4 

m 2 Pi 


os — — 2^2 + ^ 


[c 3 e 4 uq + c 2 e 4 v\ + (ci — c 3 e 2 ) e 4 v 2 + c 3 e 3 e 4 u 3 ] 


Og — 

«io = 


m 2 Pi 

2 

m 2 Pi 


[c 2 e 3 vq - c 2 e 4 iq + c 3 e| u 2 - c 3 e 3 e 4 u 3 ] 

[- (cie 2 + c 2 e 3 - c 3 e 4 ) u 0 - (cie 3 - c 2 e 4 ) iq + (cie 4 - c 3 e|) u 2 + 2c 3 e 3 e 4 v 3 ] 


Oil — ^3 + 


m 2 Pi 


[—c 3 e 4 iq + (cie 3 + c 2 e 4 ) u 2 + (cieie 3 - cie 4 + c 2 eie 4 + c 3 e 2 e 4 ) u 3 ] 


o i2 = - ^u 3 + m /' [c 3 e 4 iq - (cie 3 + c 2 e 4 ) u 2 - (cieie 3 - cie 4 + c 2 eie 4 + c 3 e 2 e 4 ) u 3 ] 


1 

oi 3 = - t; u 3 + 


4 

m 2 Pi 


[c 3 e 4 iq + (cie 4 - c 3 e 2 e 4 ) u 3 ] 


«i 4 — 


015 — 


016 — 


m 2 Pi 


m 2 Pi 


m 2 Pi 


[2c 3 e 3 u 0 + (c 2 e 3 - c 3 e 4 ) iq + (cie 3 - 2c 3 e 2 e 3 - c 2 e 4 ) u 2 
- (cie 4 - c 3 e| - c 3 e 2 e 4 ) u 3 ] 

[(ciei - c 3 e 3 ) v 0 - (c 2 e 3 - c 3 e 4 ) v x - (cie 3 - c 3 e 2 e 3 - c 2 e 4 ) u 2 
+ (cie 4 - c 3 e 2 - c 3 e 2 e 4 ) u 3 ] 

[(ciei - c 3 e 3 ) v 0 - (c 2 e 3 - c 3 e 4 ) iq - (cie 3 - c 3 e 2 e 3 - c 2 e 4 ) v 2 
+ (cie 4 - c 3 e 2 - c 3 e 2 e 4 ) u 3 ] 
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m 2 /?! 
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m 2 f3i 
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m 2 ^i 
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[(ci - c 3 e 2 ) v 0 + c 3 e 3 vi - (cie 2 - c 3 e 2 + c 3 e 4 ) v 2 + (cie 3 - c 3 e 2 e 3 ) v 3 ] 

[(c 2 ei - ci + c 3 e 2 ) v 0 - 2 c 3 e 3 vi - (c 2 e 3 - cie 2 + c 3 e2 - c 3 e 4 ) v 2 
- (c 4 e 3 - c 2 e 4 - 2 c 3 e 2 e 3 ) v 3 ] 

[c 3 e 3 v 4 - (c 3 e 2 e 3 + c 2 e 4 ) v 3 ] 


m 2 /3i 

4 


[—ci i ; 0 


C3e3 Vi + c 3 e 4 W2 - (cie 3 - c 3 e 2 e 3 ) v 3 ] 


m 2 f5i 

4 

m 2 /3i 

2 

m 2 /3i 

4 

m 2 /3i 

4 


[~ci v 0 - c 3 e 3 vi + c 3 e 4 v 2 - (cie 3 - c 3 e 2 e 3 ) v 3 ] 

[(ci - c 3 e 2 ) vi - (cie 2 - c 3 e2 + c 3 e 4 ) v 3 ] 

[ c 2 vo - (ci - c 3 e 2 ) Vi - c 3 e 3 v 2 + (c 4 e 2 - c 3 e| + c 3 e 4 ) v 3 ] 
[-c 2 v 0 - (ci - c 3 e 2 ) v 4 - c 3 e 3 v 2 + (cie 2 - c 3 e| + c 3 e 4 ) v 3 ] 


0 


m 2 /3i 

4 


[c 3 ^o + (ci - c 3 e 2 ) v 2 ] 


D Syzygies linear in h pu 

Here we gather the syzygies which we obtained as explained below equation (5.24) which are 
linear in h pu or its second derivatives. First, we define the following 26 tensors 
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= h^, 




IT 


= <rh, 







(D.l) 

vt - 

= S^ a h\ 
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[iT - 

= S^h, 



[iT - 

= g rs pa V, 

(D.2) 

[H[ 

j/xi/ 

= [S 2 ]^ 

h\ 
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[a 2 

l/ii/ 

II 

to 

a 

h, 


[H' 3 

j/ii/ 

b 

Q. 

CN 

a 

a. 

II 

h P G i 

(D.3) 

[<?i. 

JUI/ 

= S w S l 

h 

,l pG 5 


[<%. 

|/ii/ 

= s^ S^h pa , 
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[F[] 
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= [S 3 ]^ 

h\ 



T] 

/ii/ 

= [S 3 ]^ 

h, 


[Fs] 

/ii/ 

= g **" [S 3 ]^ 

h P G i 
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S va 
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/in 

II 

Fo 

to 
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hpG 5 
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/ii/ 

= [S 2 ]^ 

h P G i 
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/ii/ 
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hpG 5 
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/ii/ 
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VG 
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\ pa h 
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/ii/ 
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h P G i 
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/ii/ 

= [5 3 ]^ 

[5 3 ] 

P° h 
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(D.10) 


from which the scalars (5.5)-(5.14) are obtained by contraction with V^Vi/ and letting the 
derivative only hit h pa . Using the method explained below Eq. (5.24), we extract the following 
10 syzygies from the Cayley-Hamilton relation, using the matrix (5.24) (with the replacement 
[A]^ v = [B]^ v = [5]^, [C^u = [S 2 ] At j/ and v = [5 3 ] M V ) and keeping only the terms 
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linear in xq. 

From xq x\ X2 x 3 we get 


4 {lA[r„ - [. A' 2 r u ) + ei (2 [B[f v - [B' 2 f v - [3']^) + ef (2 [C7f]^ - [C'j^ - [Ctf v ) 
- 4e 2 ([^ - [D' 2 f v ) + (—ef + ei e 2 + e 3 ) (2 [E[f v - [. E' 2 ]* v - [E'^ v ) 

+ (-3e 3 + e ie2 ) (2 [F[f v - [F'}^ - [F']^) + (-4e 4 + 4 ei e 3 ) {[G 1 ^ - [G' 2 ]^) 

+ (—4e 4 + 3e 4 e 3 - efe 2 ) (2 [H[]* v - [H^ v - [H^ v ) 

+ (—e 4 e 4 - 3e 2 e 3 + ei e 2 2 ) (2 [l[]^ - [l' 2 f v - %}* v ) 

+ (—efe 4 + 3e 3 - e 4 e 2 e 3 ) - [J 2 fj = 0 . 

From Xu x 2 i we get 


ei (2 [B[)\ - [B' 2 \\ - [B' 3 ]\) - el ([Dif„ - \D' 2 f u ) + e 3 (2 [£(]<•„ - [£(]"„ - [Eff„) 
+ e 3 (2 - [F5r„ - [*$]<*„) - erea (2 \H[]\ - [«$]"„ - [Ef]"„) 

+ (-eie, + e 2 e 3 ) (2 [/([>•„ - [/(]*„ - [iff,,) - ef ([/,f „ - [Jfr„) = 0 . 

From x 3 x 3 x 3 we get 


(2 [Bin - \Bl r„ - ra-e) + e, (2 [Cff,, - [C“ 2 f „ - [CJ]",,) - 2e 2 (2 [Ef]",, - [£(]*, - [E' 3 ]\) 
+ (-2e 2 + e\) (2 [F{}^„ - [EfFV “ + ( e 3 + 3eie 2 - ef) ([Gf]^ - [Gf]^) 

+ (2e 2 e 2 - ef) (2 [#]"„ - [Eff„ - [Ef]"J + (e?e 2 - 2ef - e 4 ) (2 [/;]"„ - [/(]"„ - [Jif„) 

+ (~efe 3 + 2e 2 e 3 - eie 4 ) ([Jj]'V - [■t 2 ] , ',j - 0. 


From ZD £1 gr 2 we get 

2 (2 [£?;]"„ - [Bl\\ - [Ef]"„) + e t (2 [Cf]^ - [Cf]",, - [C' 3 ]\) - e t ([£fr„ - [Ef]' '„) 
+ (~ef + e 2 ) (2 [Ef]", - [Ef ]<*„ - [Ef]"J + e 2 (2 [Ef]",, - [Ef ]"„ - [Eff„) 

+ 2e 3 ([Gf]*, - [Gf]",) - ei e 2 (2 [E[p„ - [Ef]", - [£(]"„) 

+ (-264 + ef) (2 [/(]<*„ - [/(]"„ - + (—eie 4 - e 2 e 3 ) ([J(r„ - [/(]"„) = 0. 


From Xu x 3 x 3 we get 


2 (2 [CJf„ - [C' 3 f„ - [CSf „) + 2 ([D',]e„ - [Ef]*,) - e t (2 [EfP„ - [E(f„ - [Eff„) 
+ e, (2 [E[p„ - [£(]<•„ - [Ef]",,) + (-ef + 2e 2 ) ([Gf]*, - [Off J 
- ef (2 [E;]"„ - [Eff„ - [Ef]"„) + ei e 2 (2 [/!]"„ - [/(]"„ - [/ff„) 

+ (-2e 4 -eie 3 )([j;r i ,-[Ji]"J=0. 

From x 3 g’ 3 3 we get 


(2 [e;]"„ - [El ]<■„ - [Ef]*,) + (2 [F'i\\ - [Ef]<*„ - [Ef]*,) - e t ([Gf]*, - [Of ]*„) 

- er (2 [E[f„ - [Ef]*„ - [Ef]*„) + e 2 (2 [£(]"„ - [/(]*, - [/(]"„) - e 3 ([J!]"„ - [j(]«J = 0 . 
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From xq x\ x 3 we get 


3C! {[A[f v - [J^f v ) - 2e 2 (2 [B[y„ - [B' 2 f v - [B' 3 f u ) + ei e 2 (2 [Ctf v - [C' 2 f v - [C' 3 f v ) 

+ (—e 4 + ei e 3 + 4- e 2 e 2 ) (2 [F^ - [E' 2 f v - [F']^) 

+ (—e 4 - 2e 4 e 3 + e|) (2 [F[f v - [F^ v - [F^ v ) 

+ (—2e 4 e 4 — 2e 2 e 3 + 2e 4 e 3 ) {[G'-^ v — [G^]^) 

+ (—2e 4 e 4 + 2e 4 e 3 - e ie |) (2 - [H' 2 ]\ - [F']^) 

+ (e 2 e 4 - 2e 4 e 2 e 3 + e 3 ) (2 [l[}^ - [l']^ - [I^„) 

+ (e 3 e 4 — e 4 e 2 e 4 + 2e 4 e| - e|e 3 ) {[JiY u — [J^YS) = 0 • 

From xi} x 2 2 13 we get 

2d ([4f„ - [^l^) + (e\ - e 2 ) (2 [B[]\ - [B ' 2 ]<*„ - (B^) + e 3 (2 [(?;]"„ - - [(?,]"„) 

+ (e 3 - e 4 e 2 - e 3 ) {[D[Y U - [F']^) + (2 ei e 3 - 2e 4 ) (2 [F^, - [F']^ - [F'f,) 

+ (—2e 4 + ei e 3 ) (2 [F[f„ - [F']^ - [F']^) - e?e 3 (2 [H[f v - [H' 2 f v - [H' 3 f v ) 

+ (-e 2 e 4 - efe 4 + e 4 e 2 e 3 ) (2 [I[Y v - [I 2 Yu ~ Y'YY u) + ( e 3 e 4 - e 4 e 3 ) {[J[Yu - [J 2 YY) = 0 • 


From xq x 2 x 3 2 we get 

(-2d + 3ef) ([Ai]", - [A' 2 y„) - dd (2 [^f„ - \B' 2 \\ - [B^]^) 

+ (-2e 4 + 2dd) (2 [C(]<*„ - \C 2 ]\ - [C' 3 ]<‘„) 

+ (—2e 4 + 2e 4 e 3 + 2e| — 2e 4 e 2 ) {[F>[Yu — [F^j,) 

+ (e?e 3 - e 2 e 3 - e 4 e 4 ) (2 [E[Y V - [E' 2 \\ - [F']^) 

+ (-3e ie4 + e 2 e 3 ) (2 [F[f v - [F']^ - [F']^) + (2e 2 e 4 - ei e 2 e 3 ) (2 [H[Y V - [H^ - [H' 3 ]^) 
+ (—2e 4 e 2 e 4 + e|e 3 ) (2 [/']**„ - [/']**„ - [/']**„) + (2e| + ei e 3 e 4 - e 2 e|) - [J^) = 0 . 

From xq x 3 3 we get 


( e 3 _ eie2 _ e 3 ) ([Ai]^ - [A' 2 Y U ) + (-e 4 + e 4 e 3 + e| - e?e 2 ) (2 [B[Y, - [B^ - [F']^) 

+ (—e 4 e 4 - e 2 e 3 + e?e 3 ) (2 [Ctf v - [C' 2 f v - [Ctf v ) + {~e\e 4 + el) (2 [F^ - [F']^ - [F']^) 
+ (e 3 e 4 + e 4 e 2 e 4 - e ie |) (2 [#']**„ - [H' 2 f v - [H' 3 f v ) 

+ {e 2 4 - e 1 e 3 e 4 - e|e 4 + e 2 e|) (2 [I[Y U - [I'^ x v - [I 3 f v ) 

+ (e 4 e| + e 2 e 3 e 4 - e^) {[J'if v - [J 2 Y V ) = 0 . 

(D.ll) 
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Here we find that only the four following syzygies are independent 


iAr* - [4,r„ - e 2 - nr.) - « - [f 2 y„ - Kr„) 

- (et - eies) ([Gif, - - (e 4 - de 3 ) (2[H;f„ - M', - KfJ 

- e 2 e 3 (2[/;]"„ - \I' 2 }\ - [J$f„) + el ([- [/ 2 r„) = 0, 

2[B'iF„ - ~ - d ([Dir„ - MM + e 3 ([Glr„ - [G' 2 ]"„) 

-ei (21/m - - [4F,,) =0, 

2[c;r„ - Kr„ - [csr„ + ([o;i^ - [d'fj+ ei (2[Ffr„ - kf„ - Kr„) 

- (4 - d) ([G' 1 ]'‘„ - - e\ (2[ffiF„ - [H' 2 )\ - [HtfJ 

+ e 2 e 2 (2 [I[]\ - \I' 2 ]\ - [JSr„) - (eie 3 + e 4 ) (W]"„ - Kr„) = 0, 

- {ET„ + (2 [f;i \ - [Fiw - KF„) - d ([G;]^ - (cy^) 

- d (2[ff[F„ - {HT„ - W,) + e 2 (2[/;r„ - [r 2 F„ - [4]<g - e 3 ([J[Y„ - IJT„) = o. 

(D.16) 

By contracting these equations with V^V 1 ' we obtained the 4 relations (5.25)-(5.28) of the 
main text. 

E Another way to the linearized field equations 

We could as well have derived the field equations directly from the background equation in 
the form (2.28) 

spu= ~Wi 9 ^ + riWi { R ' w ~ e 9fluR )' (K1) 

Indeed we can read this equation as expressing the metric /)„, as function of the metric g^ u 
and its curvature as 

flw = ~ ( R ' JM ~ « R9lJU ) + (^ pjRP " + 36^" ~ 3 RR ^) ’ (K2) 

and then use the fact that the variation 5f^ u of / Mi , vanishes, since f^ u is not a dynamical 
variable, to obtain the field equations as an equation for 5G^ U taking the form 

A t J )a 8G p(J = T IJV , (E.3) 

with 

V PCT = |^ 2 /5o (dffi - - 2 R[^ v) + ±RS£% + l^r - \r 9iiv 9 p ° , (E.4) 


(D.12) 

(D.13) 

(D.14) 
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and 


T lw = ^m 4 / 3 0 2 h pu - ^ m 2 /3o R pa h pa g^v + ^m 2 foRhg pv - ^m 2 (3 0 Rh pu + ^R pa h pf7 R pu 

(E.5) 

- ^h R, R. pu - ^RR pa h pcr g pi/ + -^R 2 hg pv + RR p ^h u)p - R 2 h pv - R p p R^ h pa . 

We then invert formally the operator A defining B that verifies 



(E.6) 

This yields 


5G pv = B^T pa . 

(E.7) 

which should agree with the field equations (4.20) obtained previously where we 
mass matrix as, 

identify the 

m 2 M pv pa h pa = -B pv p °T pc . 

(E.8) 

Thus the following identity should hold: 


T pu = -m 2 A^M pa ^h Xl . 

(E.9) 


To check it, we first use once again the background equation to write everything in terms of 
S and find that A can be written in the simple form, 


A P a 


= m 2 ft (-2 Sffi + |W”) 


(E.10) 


We then use the expression of A4 in the j3i model, reduce all powers of S greater than three 
and use the equations (D.12)-(D.15) to finally get that the equation (E.9) is indeed verified, 
both sides of this equation being, 

T pu = —m 2 A pa M pa Xl h\j 


= rrrfii 


2 m 2 (/3 0 + /?iei) S^h u ) p - -m 2 (/? 0 + /3iei) hS pv - m 2 /3 1 S p S°h pa 


+ 2 -m 2 hS pa h pa S pu 


(E.ll) 
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